Picture for Xiao Tan

Xiao Tan

World Models as Group Actions

Add code
May 23, 2026
Viaarxiv icon

Combating Visual Neglect and Semantic Drift in Large Multimodal Models for Enhanced Cross-Modal Retrieval

Add code
Apr 28, 2026
Viaarxiv icon

LoReC: Rethinking Large Language Models for Graph Data Analysis

Add code
Apr 20, 2026
Viaarxiv icon

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Add code
Mar 19, 2026
Viaarxiv icon

Speed3R: Sparse Feed-forward 3D Reconstruction Models

Add code
Mar 09, 2026
Viaarxiv icon

From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Face Anti-Spoofing

Add code
Mar 01, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation

Add code
Nov 09, 2025
Figure 1 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 2 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 3 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 4 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Viaarxiv icon

VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving

Add code
Nov 09, 2025
Viaarxiv icon

AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving

Add code
Nov 09, 2025
Figure 1 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Figure 2 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Figure 3 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Figure 4 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Viaarxiv icon